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Abstract 



We discuss the meaning of Zipf's law in nuclear multifragmenta- 
tion. We remark that Zipf's law is a consequence of a power law 
fragment size distribution with exponent r ~ 2. We also recall why 
the presence of such distribution is not a reliable signal of a liquid-gas 
phase transition. 

PACS number(s): 25.70.Pq, 05.70.Jk, 64.60.Ak 

The search for reliable signatures of the liquid-gas phase transition in 
nuclear multifragmentation is, both theoretically and experimentally, one of 
the major issues of this field of physics. The empirical observation that the 
size distribution of heavier clusters generated in various processes satisfies 
the so called Zipf's law £Q, has raised interest and curiosity. This was first 
pointed out by Y.G. Ma [2 in the framework of the isospin dependent lattice 
gas model and more recently seen in nuclear fragmentation data 0H]. 

In the present context, Zipf's law 1 states that the mean size (mass or 
charge) s(r) of the largest, second-largest... r-largest clusters, decreases ac- 
cording to their rank r = 1,2, ■ ■ ■ ,n as 
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^dn its original formulation, Zipf's law concerns the rank of the frequency of words in 
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with A ~ 1. 

The examination of the above mentioned numerical simulations [2] and 
experimental data 0] , shows that a fairly good agreement with Eq. (1) 
is indeed obtained when the exponent is A ~ 1. This happens when other 
observables reach extreme values (maximum value of the moments of the 
cluster size distribution (c.s.d.), minimum of the effective r parameter fit of 
the c.s.d., maximum fluctuation of the largest fragment...). This seems to 
be the origin of the suggestion that the fulfillment of the Zipf 's law is a 
good signal of the liquid gas phase transition. 

The aim of this note is to point out that the finding of a Zipf's law is 
nothing but a consequence of the power law shape of the c.s.d. with exponent 
r ~ 2. More precisely, both exponents are connected through the formula 
A = l/(r-l). 

The proof of this statement is straightforward. Let be s the size (mass or 
charge) of the clusters and s(r) the size of the cluster of rank r. If the c.s.d. 
is a power low 

Pr[s G (s,s + ds)) ~ds/s T , (2) 

integrating over s, one gets the probability to find a cluster of size larger 
than S 

Pr[s > S] ~ 1/S^- 1] . (3) 

We now take S = s(r), where s(r) is the average size of clusters of rank 
r. This is a strictly decreasing real function of r, hence without degeneracy. 
On an infinite sampling, the event E, "one cluster randomly chosen has a 
size larger than s(r)" is identical to the event "his rank is larger than r". 
Arranging in ascending order the ranks from 1 to n, the probability of E is 

Pr(E) = Pr[s > s(r)] = ^—j ~ r. (4) 

On the other hand, Eq.© gives now 

Pr(E) ~ l/s(r) T_1 . 

Therefore 

r ~ l/s(r) T -\ 

and from Eq. (1), 

A = l/(r-l). (5) 
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Similar proofs can be found in the literature (see for example ref. jH], 
where the same arguments are used to proof that if the ranking follows Eq. 
(1), then the c.s.d. is necessarily a power law). 

The above formulae are strictly valid for infinite samplings. We have 
checked numerically that these remain accurate for finite samplings. We 
proceeded as follows. We generate partitions of an integer number N, with 
the condition that the mean c.s.d. is a power law of given exponent r (i.e. 
each part of iV is taken as a cluster size s). From these random numbers 
s, we construct the function s(r). As expected, for large N (N > 1000), 
the function s(r) is very close to a perfect power law in a large domain of r 
and Eq.flHJ) is satisfied within numerical uncertainties in the fits of the power 
laws. For N ~ 100 and 2 < r < 3, Eq. (JHJ) is fulfilled within a few percent. 
In general, s(l) lies above the best fit curve. This is due to the finite size. 
Indeed, in the domain of s contributing to s(l), the c.s.d. deviates from a 
power law. For the same reasons, for larger values of r the agreement is less 
good. Fig.l shows an example of s(r) calculated from a power law c.s.d. 
with r = 2.2. A best fit with Eq. (1) gives A ~ 0.86. 

It is interesting to see what happens if the c.s.d. is not a power law. For 
example, for an exponential distribution ~ exp(— as), following the same 
reasoning, one finds s(r) ~ — logr. Various theories of cluster formation 
[HI IE] offer other interesting examples. Close to their critical point, the c.s.d. 
behaves as 

n(s) ~ s-V( S "e), (6) 

where / is a scaling function, e the distance to the critical point and r, a two 
critical exponents. Right at the critical point, the scaling function f(0) = 
1. In three dimensions r ~ 2.21 for the lattice-gas model with clusters 
defined according to the Coniglio-Klein prescription |UJ [7] and r ~ 2.18 for 
percolation theory As expected, (approximate) Zipf's laws have been 
observed E] in the vicinity of the corresponding critical points. 

We present below some results for a bond-percolation calculation |8_ a with 
N = 100 occupied sites. In Fig. (2) we show as a function of the distance to 
the critical bond parameter e = p c — p, the evolution of A (upper panel) and 
the x 2 deviation of the s(r) with respect to two power law fits (lower panel). 
The continuous line (squares) is the best fit of s(r) with two parameters 
c/r x . The minimum of \ 2 occurs when A ~ 0.90 for a slope r ~ 2.16 of 
the corresponding c.s.d., in agreement with Eq.fjSJ) within 4%. The dashed- 
dotted line (crosses) corresponds to the fit with fixed A = 1 (i.e. a strict 
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Figure 1: Cluster size distribution n(s) (upper panel) and mean size s(r) 
distribution of clusters of rank r (lower panel), corresponding to a power law 
c.s.d. with r = 2.2. The size s of the clusters is generated from random 
partitions of N=100 with the constraint that the mean c.s.d. is a power law 
with exponent r = 2.2 (see text). The straight lines are best fits, with slopes 
r = 2.2 and A = 0.86, respectively. 

Zipf's law, as done in ref. 2J). This fit, that violates Eq.© by 26%, giving 
an incorrect localization of the critical point, shows as expected a larger y 2 . 
Similarly, for the lattice-gas model with Coniglio-Klein clusters one expects, 
according to Eq.(l), A ~ 0.83 0. 

Before to summarize, we would like to add a comment on the observation 
of power law c.s.d. and Zipf's laws. As mentioned before, various theories of 
cluster formation (Fischer droplets [TU], lattice-gas Lennard- Jones fluids 
with Hill's clusters jTT], percolation &j), predict power laws c.s.d. with r ~ 2 
at the corresponding critical points. However, such c.s.d. also appear else- 
where (for example, in the super-critical region of the lattice-gas and realistic 
Lennard- Jones fluids). 

In summary, (approximate) Zipf's law is just a mathematical consequence 
of a power law c.s.d. with exponent r ~ 2. Such distributions appear at 
the critical point of many theories, but also elsewhere. In consequence, we 
conclude that the observation of a Zipf's law is neither a new and independent 
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Figure 2: The slope parameter A (Eq. (1)) (upper panel) and the x 2 devi- 
ation from a power law of s(r) (lower panel), as a function of the distance 
e to the percolation critical point (see text). The continuous line (squares) 
corresponds to the best fit with A free and the dashed-dot line (crosses) with 
A = 1 Jf. 

signal of a critical behavior, nor an unambiguous signal of a thermodynamical 
phase transition. 

We thank Marc Mezard for a very helpful discussion. 
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